Home > Computers & Technology > Programming > Software Design, Testing & Engineering > Software Development

CUDA Fortran for Scientists and Engineers by Massimiliano Fatica & Gregory Ruetsch

Author:Massimiliano Fatica & Gregory Ruetsch , Date: March 26, 2014 ,Views: 514

CUDA Fortran for Scientists and Engineers by Massimiliano Fatica & Gregory Ruetsch

Author:Massimiliano Fatica & Gregory Ruetsch
Language: eng
Format: epub
ISBN: 9780124169722
Publisher: Elsevier Inc.
Published: 2013-09-15T16:00:00+00:00

3.5.2 Instruction-level parallelism

We have already seen an example of instruction-level parallelism in this book. In the transpose example of Section 3.4, a shared-memory tile of was used in most of the kernels. But because the maximum number of threads per block is 512 on certain devices, it is not possible to launch a kernel with threads per block. Instead, we have to use a thread block with fewer threads and have each thread process multiple elements. In the transpose case, blocks of threads were launched, with each thread processing four elements.

For the example in this section, we can modify the copy kernel to take advantage of instruction-level parallelism as follows:

Download

CUDA Fortran for Scientists and Engineers by Massimiliano Fatica & Gregory Ruetsch.epub

Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.

Categories

Coding Theory	Localization
Logic	Object-Oriented Design
Performance Optimization	Quality Control
Reengineering	Robohelp
Software Development	Software Reuse
Structured Design	Testing
Tools	UML

Popular ebooks

Deep Learning with Python by François Chollet(26128)
The Mikado Method by Ola Ellnestam Daniel Brolund(23443)
Hello! Python by Anthony Briggs(22578)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(21366)
Kotlin in Action by Dmitry Jemerov(20425)
Dependency Injection in .NET by Mark Seemann(20375)
The Well-Grounded Java Developer by Benjamin J. Evans Martijn Verburg(20264)
OCA Java SE 8 Programmer I Certification Guide by Mala Gupta(19439)
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(18248)
Grails in Action by Glen Smith Peter Ledbrook(17371)
Adobe Camera Raw For Digital Photographers Only by Rob Sheppard(16969)
Test-Driven iOS Development with Swift 4 by Dominik Hauser(11204)
Becoming a Dynamics 365 Finance and Supply Chain Solution Architect by Brent Dawson(8071)
Microservices with Go by Alexander Shuiskov(7837)
Practical Design Patterns for Java Developers by Miroslav Wengner(7735)
Test Automation Engineering Handbook by Manikandan Sambamurthy(7698)
Angular Projects - Third Edition by Aristeidis Bampakos(7182)
The Art of Crafting User Stories by The Art of Crafting User Stories(6635)
NetSuite for Consultants - Second Edition by Peter Ries(6551)
Demystifying Cryptography with OpenSSL 3.0 by Alexei Khlebnikov(6324)